159 research outputs found

    Integrative analysis of mRNA expression and half-life data reveals trans-acting genetic variants associated with increased expression of stable transcripts

    Get PDF
    Genetic variation in gene expression makes an important contribution to phenotypic variation and susceptibility to disease. Recently, a subset of cis -acting expression quantitative loci (eQTLs) has been found to result from polymorphisms that affect RNA stability. Here we carried out a search for trans -acting variants that influence RNA stability. We first demonstrate that differences in the activity of trans -acting factors that stabilize RNA can be detected by comparing the expression levels of long-lived (stable) and short-lived (unstable) transcripts in high-throughput gene expression experiments. Using gene expression microarray data generated from eight HapMap3 populations, we calculated the relative expression ranks of long-lived transcripts versus short-lived transcripts in each sample. Treating this as a quantitative trait, we applied genome-wide association and identified a single nucleotide polymorphism (SNP), rs6137010, on chromosome 20p13 with which it is strongly associated in two Asian populations ( p =  4×10 −10 in CHB - Han Chinese from Beijing; p  = 1×10 −4 in JPT - Japanese from Tokyo). This SNP is a cis -eQTL for SNRPB in CHB and JPT but not in the other six HapMap3 populations. SNRPB is a core component of the spliceosome, and has previously been shown to affect the expression of many RNA processing factors. We propose that a cis -eQTL of SNRPB may be directly responsible for inter-individual variation in relative expression of long-lived versus short-lived transcript in Asian populations. In support of this hypothesis, knockdown of SNRPB results in a significant reduction in the relative expression of long-lived versus short-lived transcripts. Samples with higher relative expression of long-lived transcripts also had higher relative expression of coding compared to non-coding RNA and of RNA from housekeeping compared to non-housekeeping genes, due to the lower decay rates of coding RNAs, particularly those that perform housekeeping functions, compared to non-coding RNAs

    Ground State Robustness as an Evolutionary Design Principle in Signaling Networks

    Get PDF
    The ability of an organism to survive depends on its capability to adapt to external conditions. In addition to metabolic versatility and efficient replication, reliable signal transduction is essential. As signaling systems are under permanent evolutionary pressure one may assume that their structure reflects certain functional properties. However, despite promising theoretical studies in recent years, the selective forces which shape signaling network topologies in general remain unclear. Here, we propose prevention of autoactivation as one possible evolutionary design principle. A generic framework for continuous kinetic models is used to derive topological implications of demanding a dynamically stable ground state in signaling systems. To this end graph theoretical methods are applied. The index of the underlying digraph is shown to be a key topological property which determines the so-called kinetic ground state (or off-state) robustness. The kinetic robustness depends solely on the composition of the subdigraph with the strongly connected components, which comprise all positive feedbacks in the network. The component with the highest index in the feedback family is shown to dominate the kinetic robustness of the whole network, whereas relative size and girth of these motifs are emphasized as important determinants of the component index. Moreover, depending on topological features, the maintenance of robustness differs when networks are faced with structural perturbations. This structural off-state robustness, defined as the average kinetic robustness of a network's neighborhood, turns out to be useful since some structural features are neutral towards kinetic robustness, but show up to be supporting against structural perturbations. Among these are a low connectivity, a high divergence and a low path sum. All results are tested against real signaling networks obtained from databases. The analysis suggests that ground state robustness may serve as a rationale for some structural peculiarities found in intracellular signaling networks

    Alterations in hepatic miRNA expression during negative energy balance in postpartum dairy cattle

    Get PDF
    peer-reviewedBackground Negative energy balance (NEB), an altered metabolic state, occurs in early postpartum dairy cattle when energy demands to support lactation exceed energy intake. During NEB the liver undergoes oxidative stress and increased breakdown of fatty acids accompanied by changes in gene expression. It is now known that micro RNAs (miRNA) can have a role in mediating such alterations in gene expression through repression or degradation of target mRNAs. miRNA expression is known to be altered by metabolism and environmental factors and miRNAs are implicated in expression modulation of metabolism related genes. Results miRNA expression was profiled in the liver of moderate yielding dairy cattle under severe NEB (SNEB) and mild NEB (MNEB) using the Affymetrix Gene Chip miRNA_2.0 array with 679 probe sets for Bos-taurus miRNAs. Ten miRNAs were found to be differentially expressed using the ‘samr’ statistical package (delta = 0.6) at a q-value FDR of < 12%. Five miRNAs including miR-17-5p, miR-31, miR-140, miR-1281 and miR-2885 were validated using RT-qPCR, to be up-regulated under SNEB. Liver diseases associated with these miRNAs include non-alcoholic fatty liver (NAFLD) and hepatocellular carcinoma (HCC). miR-140 and miR-17-5p are known to show differential expression under oxidative stress. A total of 32 down-regulated putative target genes were also identified among 418 differentially expressed hepatic genes previously reported for the same animal model. Among these, GPR37 (G protein-coupled receptor 37), HEYL (hairy/enhancer-of-split related with YRPW motif-like), DNJA1, CD14 (Cluster of differentiation 14) and GNS (glucosamine (N-acetyl)-6-sulfatase) are known to be associated with hepatic metabolic disorders. In addition miR-140 and miR-2885 have binding sites on the most down-regulated of these genes, FADS2 (Fatty acid desaturase 2) which encodes an enzyme critical in lipid biosynthesis. Furthermore, HNF3-gamma (Hepatocyte nuclear factor 3-gamma), a hepatic transcription factor (TF) that is involved in IGF-1 expression regulation and maintenance of glucose homeostasis is a putative target of miR-31. Conclusions This study shows that SNEB affects liver miRNA expression and these miRNAs have putative targets in hepatic genes down-regulated under this condition. This study highlights the potential role of miRNAs in transcription regulation of hepatic gene expression during SNEB in dairy cattle. Background Negative energy balance (NEB), an altered metabolic state, occurs in early postpartum dairy cattle when energy demands to support lactation exceed energy intake. During NEB the liver undergoes oxidative stress and increased breakdown of fatty acids accompanied by changes in gene expression. It is now known that micro RNAs (miRNA) can have a role in mediating such alterations in gene expression through repression or degradation of target mRNAs. miRNA expression is known to be altered by metabolism and environmental factors and miRNAs are implicated in expression modulation of metabolism related genes. Results miRNA expression was profiled in the liver of moderate yielding dairy cattle under severe NEB (SNEB) and mild NEB (MNEB) using the Affymetrix Gene Chip miRNA_2.0 array with 679 probe sets for Bos-taurus miRNAs. Ten miRNAs were found to be differentially expressed using the ‘samr’ statistical package (delta = 0.6) at a q-value FDR of < 12%. Five miRNAs including miR-17-5p, miR-31, miR-140, miR-1281 and miR-2885 were validated using RT-qPCR, to be up-regulated under SNEB. Liver diseases associated with these miRNAs include non-alcoholic fatty liver (NAFLD) and hepatocellular carcinoma (HCC). miR-140 and miR-17-5p are known to show differential expression under oxidative stress. A total of 32 down-regulated putative target genes were also identified among 418 differentially expressed hepatic genes previously reported for the same animal model. Among these, GPR37 (G protein-coupled receptor 37), HEYL (hairy/enhancer-of-split related with YRPW motif-like), DNJA1, CD14 (Cluster of differentiation 14) and GNS (glucosamine (N-acetyl)-6-sulfatase) are known to be associated with hepatic metabolic disorders. In addition miR-140 and miR-2885 have binding sites on the most down-regulated of these genes, FADS2 (Fatty acid desaturase 2) which encodes an enzyme critical in lipid biosynthesis. Furthermore, HNF3-gamma (Hepatocyte nuclear factor 3-gamma), a hepatic transcription factor (TF) that is involved in IGF-1 expression regulation and maintenance of glucose homeostasis is a putative target of miR-31. Conclusions This study shows that SNEB affects liver miRNA expression and these miRNAs have putative targets in hepatic genes down-regulated under this condition. This study highlights the potential role of miRNAs in transcription regulation of hepatic gene expression during SNEB in dairy cattle

    The miRNAome of the postpartum dairy cow liver in negative energy balance

    Get PDF
    peer-reviewedBackground: Negative energy balance (NEB) is an altered metabolic state in high yielding cows that occurs during the first few weeks postpartum when energy demands for lactation and maintenance exceed the energy supply from dietary intake. NEB can, in turn, lead to metabolic disorders and to reduced fertility. Alterations in the expression of more than 700 hepatic genes have previously been reported in a study of NEB in postpartum dairy cows. miRNAs (microRNA) are known to mediate many alterations in gene expression post transcriptionally. To study the hepatic miRNA content of postpartum dairy cows, including their overall abundance and differential expression, in mild NEB (MNEB) and severe NEB (SNEB), short read RNA sequencing was carried out. To identify putative targets of differentially expressed miRNAs among differentially expressed hepatic genes reported previously in dairy cows in SNEB computational target identification was employed. Results: Our results indicate that the dairy cow liver expresses 53 miRNAs at a lower threshold of 10 reads per million. Of these, 10 miRNAs accounted for greater than 95% of the miRNAome (miRNA content). Of the highly expressed miRNAs, miR-122 constitutes 75% followed by miR-192 and miR-3596. Five out of thirteen let-7 miRNA family members are also among the highly expressed miRNAs. miR-143, down-regulated in SNEB, was found to have 4 putative up-regulated gene targets associated with SNEB including LRP2 (low density lipoprotein receptor-related protein 2), involved in lipid metabolism and up-regulated in SNEB. Conclusions: This is the first liver miRNA-seq profiling study of moderate yielding dairy cows in the early postpartum period. Tissue specific miR-122 and liver enriched miR-192 are two of the most abundant miRNAs in the postpartum dairy cow liver. miR-143 is significantly down-regulated in SNEB and putative targets of miRNA-143 which are up-regulated in SNEB, include a gene involved in lipid metabolism.Teagasc Walsh Fellowship Programm

    Seq-ing improved gene expression estimates from microarrays using machine learning

    Get PDF
    BACKGROUND: Quantifying gene expression by RNA-Seq has several advantages over microarrays, including greater dynamic range and gene expression estimates on an absolute, rather than a relative scale. Nevertheless, microarrays remain in widespread use, demonstrated by the ever-growing numbers of samples deposited in public repositories. RESULTS: We propose a novel approach to microarray analysis that attains many of the advantages of RNA-Seq. This method, called Machine Learning of Transcript Expression (MaLTE), leverages samples for which both microarray and RNA-Seq data are available, using a Random Forest to learn the relationship between the fluorescence intensity of sets of microarray probes and RNA-Seq transcript expression estimates. We trained MaLTE on data from the Genotype-Tissue Expression (GTEx) project, consisting of Affymetrix gene arrays and RNA-Seq from over 700 samples across a broad range of human tissues. CONCLUSION: This approach can be used to accurately estimate absolute expression levels from microarray data, at both gene and transcript level, which has not previously been possible. This methodology will facilitate re-analysis of archived microarray data and broaden the utility of the vast quantities of data still being generated

    A plant natriuretic peptide-like gene in the bacterial pathogen Xanthomonas axonopodis may induce hyper-hydration in the plant host: a hypothesis of molecular mimicry

    Get PDF
    BACKGROUND: Plant natriuretic peptides (PNPs) are systemically mobile molecules that regulate homeostasis at nanomolar concentrations. PNPs are up-regulated under conditions of osmotic stress and PNP-dependent processes include changes in ion transport and increases of H(2)O uptake into protoplasts and whole tissue. PRESENTATION OF THE HYPOTHESIS: The bacterial citrus pathogen Xanthomonas axonopodis pv. Citri str. 306 contains a gene encoding a PNP-like protein. We hypothesise that this bacterial protein can alter plant cell homeostasis and thus is likely to represent an example of molecular mimicry that enables the pathogen to manipulate plant responses in order to bring about conditions favourable to the pathogen such as the induced plant tissue hyper-hydration seen in the wet edged lesions associated with Xanthomonas axonopodis infection. TESTING THE HYPOTHESIS: We found a Xanthomonas axonopodis PNP-like protein that shares significant sequence similarity and identical domain organisation with PNPs. We also observed a significant excess of conserved residues between the two proteins within the domain previously identified as being sufficient to induce biological activity. Structural modelling predicts identical six stranded double-psi β barrel folds for both proteins thus supporting the hypothesis of similar modes of action. No significant similarity between the Xanthomonas axonopodis protein and other bacterial proteins from GenBank was found. Sequence similarity of the Xanthomonas axonopodis PNP-like protein with the Arabidopsis thaliana PNP (AtPNP-A), shared domain organisation and incongruent phylogeny suggest that the PNP-gene may have been acquired by the bacteria in an ancient lateral gene transfer event. Finally, activity of a recombinant Xanthomonas axonopodis protein in plant tissue and changes in symptoms induced by a Xanthomonas axonopodis mutant with a knocked-out PNP-like gene will be experimental proof of molecular mimicry. IMPLICATION OF THE HYPOTHESIS: If the hypothesis is true, it could at least in part explain why the citrus pathogen Xanthomonas campestris that does not contain a PNP-like gene produces dry corky lesions while the closely related Xanthomonas axonopodis forms lesions with wet edges. It also suggests that genes typically found in the host, horizontally transferred or heterologous, can help to explain aspects of the physiology of the host-pathogen interactions

    FRAGS: estimation of coding sequence substitution rates from fragmentary data

    Get PDF
    BACKGROUND: Rates of substitution in protein-coding sequences can provide important insights into evolutionary processes that are of biomedical and theoretical interest. Increased availability of coding sequence data has enabled researchers to estimate more accurately the coding sequence divergence of pairs of organisms. However the use of different data sources, alignment protocols and methods to estimate substitution rates leads to widely varying estimates of key parameters that define the coding sequence divergence of orthologous genes. Although complete genome sequence data are not available for all organisms, fragmentary sequence data can provide accurate estimates of substitution rates provided that an appropriate and consistent methodology is used and that differences in the estimates obtainable from different data sources are taken into account. RESULTS: We have developed FRAGS, an application framework that uses existing, freely available software components to construct in-frame alignments and estimate coding substitution rates from fragmentary sequence data. Coding sequence substitution estimates for human and chimpanzee sequences, generated by FRAGS, reveal that methodological differences can give rise to significantly different estimates of important substitution parameters. The estimated substitution rates were also used to infer upper-bounds on the amount of sequencing error in the datasets that we have analysed. CONCLUSION: We have developed a system that performs robust estimation of substitution rates for orthologous sequences from a pair of organisms. Our system can be used when fragmentary genomic or transcript data is available from one of the organisms and the other is a completely sequenced genome within the Ensembl database. As well as estimating substitution statistics our system enables the user to manage and query alignment and substitution data

    Mining Mammalian Transcript Data for Functional Long Non-Coding RNAs

    Get PDF
    BACKGROUND: The role of long non-coding RNAs (lncRNAs) in controlling gene expression has garnered increased interest in recent years. Sequencing projects, such as Fantom3 for mouse and H-InvDB for human, have generated abundant data on transcribed components of mammalian cells, the majority of which appear not to be protein-coding. However, much of the non-protein-coding transcriptome could merely be a consequence of 'transcription noise'. It is therefore essential to use bioinformatic approaches to identify the likely functional candidates in a high throughput manner. PRINCIPAL FINDINGS: We derived a scheme for classifying and annotating likely functional lncRNAs in mammals. Using the available experimental full-length cDNA data sets for human and mouse, we identified 78 lncRNAs that are either syntenically conserved between human and mouse, or that originate from the same protein-coding genes. Of these, 11 have significant sequence homology. We found that these lncRNAs exhibit: (i) patterns of codon substitution typical of non-coding transcripts; (ii) preservation of sequences in distant mammals such as dog and cow, (iii) significant sequence conservation relative to their corresponding flanking regions (in 50% cases, flanking regions do not have homology at all; and in the remaining, the degree of conservation is significantly less); (iv) existence mostly as single-exon forms (8/11); and, (v) presence of conserved and stable secondary structure motifs within them. We further identified orthologous protein-coding genes that are contributing to the pool of lncRNAs; of which, genes implicated in carcinogenesis are significantly over-represented. CONCLUSION: Our comparative mammalian genomics approach coupled with evolutionary analysis identified a small population of conserved long non-protein-coding RNAs (lncRNAs) that are potentially functional across Mammalia. Additionally, our analysis indicates that amongst the orthologous protein-coding genes that produce lncRNAs, those implicated in cancer pathogenesis are significantly over-represented, suggesting that these lncRNAs could play an important role in cancer pathomechanisms

    MScanner: a classifier for retrieving Medline citations

    Get PDF
    BACKGROUND: Keyword searching through PubMed and other systems is the standard means of retrieving information from Medline. However, ad-hoc retrieval systems do not meet all of the needs of databases that curate information from literature, or of text miners developing a corpus on a topic that has many terms indicative of relevance. Several databases have developed supervised learning methods that operate on a filtered subset of Medline, to classify Medline records so that fewer articles have to be manually reviewed for relevance. A few studies have considered generalisation of Medline classification to operate on the entire Medline database in a non-domain-specific manner, but existing applications lack speed, available implementations, or a means to measure performance in new domains. RESULTS: MScanner is an implementation of a Bayesian classifier that provides a simple web interface for submitting a corpus of relevant training examples in the form of PubMed IDs and returning results ranked by decreasing probability of relevance. For maximum speed it uses the Medical Subject Headings (MeSH) and journal of publication as a concise document representation, and takes roughly 90 seconds to return results against the 16 million records in Medline. The web interface provides interactive exploration of the results, and cross validated performance evaluation on the relevant input against a random subset of Medline. We describe the classifier implementation, cross validate it on three domain-specific topics, and compare its performance to that of an expert PubMed query for a complex topic. In cross validation on the three sample topics against 100,000 random articles, the classifier achieved excellent separation of relevant and irrelevant article score distributions, ROC areas between 0.97 and 0.99, and averaged precision between 0.69 and 0.92. CONCLUSION: MScanner is an effective non-domain-specific classifier that operates on the entire Medline database, and is suited to retrieving topics for which many features may indicate relevance. Its web interface simplifies the task of classifying Medline citations, compared to building a pre-filter and classifier specific to the topic. The data sets and open source code used to obtain the results in this paper are available on-line and as supplementary material, and the web interface may be accessed at http://mscanner.stanford.edu
    corecore